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(57) Abstract 

The invention relates to a 
protein stabilizing sequence parti- 
cularly useful for stabilization of 
proteolytically sensitive proteins. 
The sequence includes a relatively 
small number of amino acids that 
may be expressed fused with a pro- 
teolytically sensitive protein. The 
most effective stabilization se- 
quences assume a-helix structures 
with a hydrophobic face and a posi- 
tively charged polar face which ap- 
pear to require proper orientation 
with respect to each other. Other 
aspects of the invention include 
cloning vectors incorporating a 
gene sequence encoding the stabili- 
zation polypeptide and production 
of stabilized antigenic proteins. 
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DESCRIPTION 

SEQUENCE FOR STABILIZING PROTEINS IN BACTERIA 
PROTEIN STABILIZATION SEQUENCE 

The United States Government may have certain rights 
in the present invention pursuant to the terms of Grant 
No. 86-13978 awarded by the National Science Foundation. 

BACKGROUND OF THE INVENTION 

Field of the Invention 

The invention relates to an amino acid sequence 
useful for stabilization of otherwise unstable proteins, 
particularly proteolytically sensitive proteins. The 
invention also relates to methods of producing stabilized 
proteins by direct attachment of the stabilizing sequence 
or through fusion proteins expressed from recombinant 
host cells. 

Description of Related Art 

One of the major problems in the use and preparation 
of therapeutic and commercial proteins is degradation by 
cell proteases. This may occur in vivo in plasma or 
inside a host cell when recombinant methods of 
polypeptide production are employed. Degradation plays 
an obvious role in the elimination of damaged or abnormal 
proteins but also affects half lives of normal proteins. 
Individual protein turnover rates may vary 1000 fold, 
depending on environmental and structural factors. 

Two general factors contribute to the half-life of 
any given protein. One factor includes "global" features 
such as large size, hydrophobicity , thermal instability 
and charge characteristics. The other factor is related 
to sequence specific parameters of particular a-amino 
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terminii, asparagine residues and so-called PEST 
sequences. Although correlations have been made between 
particular features and susceptibility to degradation, 
explicit degradation signals and pathways are not well 
defined. It is not even known whether peptide bond 
cleavage or biochemical marking is the primary event 
signaling protein destruction. 

In contrast to degradation factors, there appear to 
be stabilization factors that enhance stability of a 
protein toward proteolysis. It has been suggested that 
both thermodynamic stability and proteolytic 
susceptibility are major determining factors in rate of 
degradation (Parsell and Sauer, 1989) . There has been 
some evidence that particular sequences at the N-terminus 
of lambda repressor in Escherichia coli aid in 
maintaining a stable tertiary structure and therefore 
slow proteolytic degradation (Parsell and Sauer, 1989) . 
Some evidence also suggests that the increased stability 
of certain fusion proteins may derive from the ability of 
a stable fusion partner such as protein G or 
£-galactosidase to stabilize the tertiary structure of a 
proteolytically sensitive protein (Hellebust et al., 
1989) . 

A particular amino acid sequence attached as a 
"tail" to the C-terminal of Arc, a small dimeric 
DNA-binding protein found in bacteriophage P22, has been 
found to stabilize that protein against proteolysis. The 
primary sequence of the tail consists of 25 amino acids 
which when attached to a heterologous protein, the LP57 
mutant of X -repressor in Escherichia coli, also 
stabilized that protein (Bowie and Sauer, 1989) . 

Nevertheless, up until now, an amino acid sequence 
attachable to a proteolytically sensitive protein has not 
been found which will generally increase stability of 
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these proteins. The value of such a sequence would lie 
in its ability to increase the half-life of commercially 
desired proteins for production in recombinant cell hosts 
and the potential to protect proteinaceous vaccines from 
serum protease degradation when injected directly into an 
animal. 



SUMMARY OF THE INVENTION 

The invention generally relates to a protein 
stabilization sequence. The sequence is comprised of 
amino acids that may be attached to proteolytically 
sensitive proteins. 

The stabilization sequence includes a limited number 
of amino acids ranging from about ten to about 50 
residues. The amino acids need not be confined to a 
particular type of amino acid; however, the amino acid 
residues will be such that the secondary and tertiary 
structure assumes the form of an outwardly directed, 
properly aligned hydrophobic face and a positively 
charged polar face. The importance of this structure has 
been shown by substituting two of the amino acids causing 
the structure to have a twisted form. This form was 
shown to be less stable. The alignment of amino acid 
groups within a particular a-helical form appears to be 
important in order for the amino acid sequence to have 
maximal effect as a stabilization sequence. 

The a-helical structure assumed by the amino acid 
protecting sequence is an important aspect of the 
invention. Thus it is number and type of amino acid 
residues involved, and the structure assumed by the amino 
acid residues making up the structure. Twisted forms of 
the a-helix structure with the hydrophobic and polar 
faces out of alignment provide less stability to fused 
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heterologous proteins. Thus in preferred embodiments the 
outwardly directed hydrophobic face of the structure is 
in alignment with positively charged face of the cr-helix. 

Although it is believed that such stabilization 
structures could be formed with as many as a 100 amino 
acid residues, the amino acid sequence used is preferably 
from about 10 to about 50 acid residues and more 
preferably 29 amino acid residues. A particularly 
preferred amino acid sequence is shown in Figure 1. 

Another aspect of the invention is the attachment of 
the protein stabilization sequence to a proteolytically 
sensitive enzyme. Such attachments may be made at the 
time of synthesis of the proteolytically sensitive 
protein of interest or alternatively may be used in in 
vitro situations where it is desired to protect a 
proteolytically sensitive protein from degradation. An 
example might include the use of the protecting sequence 
to protect an antigenic protein that is being used to 
promote antibody formation in vivo. Protection of the 
antigen from serum proteases would provide more effective 
use of expensive antigenic materials that are in short 
supply. 

In a preferred embodiment, the protein stabilization 
sequence is prepared in the form of a cassette in which 
the DNA segment encoding the sequence is incorporated 
into a cloning vector. Proper construction of the vector 
results in a protein of interest being expressed with the 
polypeptide protecting group attached to the protein, 
thus protecting it from protease degradation within the 
host cell in which it is expressed. 

Attachment of the protein stabilization sequence is 
generally to the amino-terminal end of the 
proteolytically sensitive protein or to the carboxy 
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tenainal end. In some cases, for example, exceptionally 
long proteolytically sensitive proteins it may be 
desirable to attach the stabilization sequence to both 
termini. In vitro attachment methods may be readily 
accomplished using chemical attachments well known to 
those of skill in the art. Attachment of such sequences 
in vivo is possible by constructing appropriately 
designed cloning rectors. 

The protein stabilization sequence described has 
general utility and is not limited to stabilization of 
the polymerase sigma factor from which it was originally 
derived. RNA polymerase sigma factors from B . subtilis 
or E. coli have been prepared as fusion proteins with the 
stabilization factor originally isolated from B . subtilis 
sigma factor. It is envisioned that virtually any DNA 
segment encoding a polypeptide could be constructed in a 
cloning vector or expression vector with the DNA segment 
encoding the stabilization factor sequence. Examples of 
other proteolytically sensitive proteins that might be 
protected with the stabilization sequence include 
proinsulin, epidermal growth factor, interleukin, 
interferon, somato tropin, insulin-like growth factor, 
phosphatase, immunoglobulin Fv and the like. Moreover 
expression may be from a variety of suitable bacterial 
cells provided that they can be suitably transformed by 
an appropriately constructed cloning vector. Examples of 
suitable host microorganisms include E. coli, B . 
subtilis, S. tymphyrium and the like. 

The invention is also envisioned as a method of 
enhancing stability of recombinant proteins. A first 
step would be to obtain a gene segment encoding the 
protein stabilization sequence. Such a sequence is not 
limited to a particular number or type of amino acid 
residues provided that the residues form the appropriate 
structure having the stabilization properties. Thus 



WO 93/03156 



PCT/US92/05810 



-6- 

under certain conditions it may be desirable to encode up 
to 100 amino acids in the sequence. In a further step a 
recombinant vector is prepared . This vector will have a 
gene encoding the protein stabilization sequence selected 
as well as a gene encoding a protein desired to be 
stabilized. Generally, this is accomplished by preparing 
a cloning site within a plasmid vector that encodes the 
protein stabilization sequence, then inserting into that 
site a gene or a gene segment encoding the protein 
desired to be stabilized. The cloning site may be 
modified to include one or more unique restriction sites 
but in general will be constructed so that insertion of 
the gene of choice will restore a proper reading frame 
between the protein stabilization sequence and the 
cloning site through the desired gene segment. Such a 
recombinant vector may then be used to transform a host 
cell. Transformed host cell colonies will then be 
selected, for example by incorporating selected genes 
within the recombinant vectors such as antibiotic 
resistance genes or alkaline phosphatase expressing genes 
which will elicit positive or negative responses 
depending on whether the desired gene has been cloned 
into the cloning site. Once transformed host cell 
colonies are identified, expression of a heterologous 
protein fused with the af oredescribed protein 
stabilization sequence may be affected. This is 
generally performed using standard culture methods well 
known to those of skill in the art. 

Expression of a wide variety of heterologous 
proteins fused with the protein stabilization sequences 
should be possible. In general proteolytically sensitive 
proteins suitable for production as stable heterologous 
proteins from prokaryotic hosts will be proteins that are 
expressible as active fusion proteins. Such proteins 
include B-galactosidase, sigma factor and the like. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows an amino acid sequence capable of 
forming a protecting hydrophobic faced a-helix structure 
when attached to a proteolytically sensitive protein. 

Figure 2 shows the nucleic acid sequence encoding a 
29 amino acid protective sequence. 

Figure 3 illustrates generally the oe-helix formed by 
amino acid sequences that confer proteolytic stability on 
proteins to which they are N-terminally attached. A 
hydrophobic face and a positively charged polar face are 
aligned on the a-helix. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
Materials and Methods 

Bacterial Strains and Plasmids 

B. subtilis strains JH-642 (pheA trpC2) and 1A287 
(rpsl) were obtained from J. Hoch and the Bacillus 
Genetic stock Center (Ohio State University) , 
respectively. SR104 has an internal deletion in sigE. 
It was created by transformation of strain SMY with DNA 
from CM01 (trpC2 pheAl AsigE: zerrn) . The M13 host strains 
CJ236 (dut ung) and JM103 (dut+ ungr+) were obtained from 
V. Deretic (University of Texas Health Science Center at 
San Antonio) . Escherichia coli strain TB-1 was provided 
by D. Kolodrubetz (University of Texas Health Science 
Center at San Antonio). Plasmids pSR-5 (spoIID: :lacZ) 
(Rong et al . 1986), pSI-l {pSpac) (Yansura and Henner, 
1984), pGSIIGll (sigE) (Stragier et a J . , 1984), and 
pSGMU31 (Errington, 1986) have been described, pJM102, 
obtained from J. Hoch, is pUC18 (Yanisch-Perron et al . , 
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1985) with a chloramphenicol acetyltransf erase (cat: 950 
base pairs [bp]) gene cloned into its unique Ndel site, 
pSGMU3lA2 was derived from pSGMU31 by cutting the plasmid 
at its two BamHI sites and single Bglll site and 
relegating the two major fragments. This results in loss 
of a small, nonessential BamHI-Bglll fragment (150bp) and 
the creation of a single BamHI site in the vector. 
pSGMU3lA2 has the BamHI site and the lacZ gene of the 
original plasmid downstream of the plasmid 's lac 
promoter. pSR22 is pBR322 (Bolivar et al., 1977) and 
pUBHO (Gryczan et al., 1978) joined at their BamHI sites 
plus a 1.1-kbp fragment encoding sigE (Stragier et al. 
1984) . The sigE fragment was cloned into the EcoRI site 
of pBR322 by using EcoRI linkers. pSR51 and pSR54 are 
variants of pSR22 with missense mutations in the coding 
sequence of sigE. 

Oliaonucleotide-Directed Mutagenesis 

The 1.1-kpb PstI fragment of pGSIIGll was cloned 
into M13mpl9 (Yanisch-Perron et al . 1985), and its 
orientation within the vector was determined by 
restriction endonuclease analysis of RF DNA. To 
construct sigEA84, an oligonucleotide (60-mer) that 
carried the sequence 30 bases upstream of and including 
the ATG codon (nucleotides 179 to 181 of the 1.1-kpb 
fragment) of P 31 and 3 0 bases downstream of the a E amino 
terminus (nucleotides 266 to 268) was synthesized. The 
60-mer was hybridized to single-stranded M13 DNA 
containing its complementary sequence within the cloned 
1.1-kpb PstI fragment. The P^-specific sequence was 
"looped out" as a consequence of using DNA polymerase I 
to synthesize a complement to the circular DNA by using 
the oligonucleotide as primer. Enrichment for the 
deleted sequence was accomplished by Si nuclease 
treatment of the 84-base loop following the 
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polymerization reaction. The hybridization, 
polymerization, and SI treatments were accomplished as 
described by Adelman et al. (Adelman et al. 1983). 
Following transformation of E. coli JM103, mutagenized 
clones were screened by restriction endonuclease analysis 
for reduction in the size of the cloned PstI fragment 
(i.e., i.i kbp to 1.02 kbp) and the loss of the Hindu 
site originally present in the loop region. 
Approximately 5% of the resulting clones had the desired 
mutation. 

Additional mutant clones sigEA81, sigEA48 , sigEA2 52, 
and sigrEA2 53 were constructed by using oligonucleotides 
of 30 bases (30-mers) (Kunkel, 1984). M13mpl8 containing 
either the original l.l-kbp sigE fragment for si gEA81 and 
sigEA84) or 1.02-kbp sigEA84 fragment (for sigEA252 and 
sigEA253) was plaque purified twice on E. coli CJ236 (dut 
ung) and grown on this host to incorporate uracil into 
the phage DNA. Following hybridization to a 30-mer, 
Sequenase:2.0 (United States Biochemical Corp., 
Cleveland, Ohio) was used to synthesize the complementary 
strand. Selection for the strand polymerized in vitro 
was accomplished by transforming and plating the 
polymerization mixture on E. coli JM103 (dut+ ung+) , 
which selects against the uracil-containing template. 
From 50 to 90% of the clones arising from the reaction 
contained the desired mutations. These were identified 
by size changes in the cloned B . subtilis fragment and by 
DNA sequencing. 

Strain Construction 

The E. coli lac gene was translationally coupled to 
sigE by cloning a 34 0-bp Pstl-SauJJJAl fragment of the 
l.l-kbp clones sigE gene into Ps tl-BamH J-cut pSGMU31A2. 
This coupled the 5' end of JacZ in frame to sigE at a 
site 165 bp downstream of the sequence encoding the amino 
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terminus of P 31 . In a similar fashion, the 256-bp 
Pstl-SauIIIAl fragment from sigEA84 was cloned into this 
vector. This construct was formed at the same site 
within sigE but had only 81 bp of the sigE protein-coding 
region upstream of lacZ. JH642 was transformed with 
pSGMU31A2 containing the coding sequence for either the 
sigE+- or sigEA8 4 -encoded amino terminus. Clones in 
which the plasmid had integrated into the B. subtilis 
chromosome were selected by chloramphenicol resistance 
(2.5 ng/ml) and screened by Southern blot analysis for 
integration into the sigE locus with the proper sigE 
allele positioned upstream of lacZ. 

Strains of JH642 in which the wild-type sigE locus 
had been replaced by mutant sigE alleles were constructed 
by congression of the mutant alleles, cloned as a 1- to 
1.1-kbp PstI fragment in JM102 f with a streptomycin 
resistance (Str 1 ) marker from B. subtilis 1A287. Str r 
colonies were screened for Spo" by failure of the colonies 
to run brown on DS (Schaeffer et al., 1965} plates after 
24 h at 37°C. Spo" clones were then screened for Cm* to 
identify those in which the mutant allele had entered the 
chromosome by gene replacement rather than integration of 
the entire plasmid. Chromosomal DNA from clones that met 
these criteria were screened by Southern blot analysis 
for the predicted gene replacement. 

Analysis of Extracts for P 31 and cr E -like Proteins 

B • subtilis cultures were grown in DS medium, cells 
were harvested, and protein extracts were prepared as 
described previously (Trempy et al. r 1985), except that 
the ammonium sulfate step was omitted. Protein samples 
(100 tig) (determined by the Coomassie method, Bio-Rad 
Laboratories) were precipitated with 2 volumes of cold 
ethanol, suspended in sample buffer, and fractionated on 
sodium dodecyl sulf ate-polyacrylamide gels (12% 
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acrylamide) . Following electrophoretic transfer to 
nitrocellulose and blocking of the nitrocellulose with 
Blotto, the protein bands were probed with an anti-P 31 /a c 
monoclonal antibody (Trempy et al., 1985). Bound 
antibody was visualized by using a horseradish 
peroxidase-conjugated goat immunoglobulin against mouse 
immunoglobulin (Hyclone Laboratories, Inc.) or 125 I-rabbit 
anti-mouse immunoglobulin antibody (K. Krolick, 
University of Texas Health Science Center at San 
Antonio) . 

fl-Galactosidase Assays 

B. subtilis strains carrying either pSR5 or an 
integrated pSGMU31A2 plasmid were grown in DS medium and 
harvested at various times during growth and sporulation. 
Cells were disrupted by passage through a French pressure 
Cell (twice at 20,000 lb/in 2 ) and analyzed for 
B-galactosidase activity as described by Miller (Miller, 
1972) . 

DNA Sequencing 

DNA sequencing was performed by the method of Sanger 
et al. (1977) with the Sequenase reagents (U.S. 
Biochemical Corp.) and the protocol provided by the 
manufacturer. 

Construction of the si gVARd aIIpIp 

The sigE gene was reconfigured to encode a a E -like 
protein as its direct product. A single-stranded 
oligonucleotide was hybridized to a region within a 
single-stranded copy of this fragment (cloned in 
M13mpl9) . This resulted in the looping out of the DNA 
that encodes the precursor element of P 31 . A complement 
to the M13-containing DNA was synthesized by using the 
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oligonucleotide as a primer. Following second-strand 
synthesis and destruction of the loop by Si nuclease, the 
DNA mixture was introduced into E. coli by 
transformation. Cloned DNAs which both lost the Hindll 
site that is unique to the precursor portion of the 
sequence and contained a PstI fragment of approximately 
1.020 bp (rather than the parental 1.100 bp) were picked 
as putative sigE mutants (sigrEA84) , expected to contain a 
sequence in which the region encoding the cr E amino 
terminus is positioned immediately downstream of the P 31 
initiation codon and ribosome-binding site. This was 
verified by sequencing the splice regions of 
constructions that were positive by the restriction 
endonuclease criteria. 

The splice region of one of the clones diverged from 
the parental sequence only after the initiating ATG 
codon. At that point, the sequence encoding the o E amino 
terminus began in frame with the ATG codon. 

The 1.02-kbp FstJ fragment carrying the sigEA84 
allele was cloned into unique PstJ sites of the plasmid 
vectors pJM102 and pSI-1 to form pJMA84 and pSlA84, 
respectively. The PstI site of each of these plasmids 
lay downstream of an IPTG 

(isopropylthiogalactopyranoside) -inducible promoter) Plac 
in pJM102 and Pspac in pSI-1) . Both plasmids replicated 
in E. coli, but only pSI-l replicated in J3. subtilis. 
The sigEA84 gene was cloned in the proper orientation for 
expression from the inducible promoters as verified by 
restriction endonuclease analysis of the ability of this 
gene to specify a product in E. coli. 
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EXAMPLE I 

Production of Sigma Protein in Escherichia coli 

Cultures of E. coli which carried either the mutant 
or wild-type sigE allele under Plac or Pspac control were 
induced with IPTG. The amount of P 31 or a E A84 present at 2 
h after induction was determined by Western immunoblot 
analysis. An autoradiogram of the results of this 
experiment is presented in Fig. 3. Significant amounts 
of protein that reacted with the anti-P 31 /a E antibody were 
present in all of the extracts. The extracts from 
strains carrying the sigrEA84 allele synthesized a protein 
(a E A84) with the apparent mobility of a E , while the 
wild-type allele specified a protein with the mobility of 
P 31 . In addition to the a E A84 band, extracts prepared from 
the sigEA84 strains contained one or more prominent 
lower-molecular-weight proteins that reacted with the 
antibody. The lower-molecular-weight proteins varied in 
abundance in different extract preparations and probably 
represent breakdown products of a E . They were similar in 
size to but of greater intensity than bands when 
analyzing extracts of B. subtilis cells that synthesize 
P 31 and a E (Jonas et al., 1989). Densitometry measurements 
of radioactively labeled antibody bound to the Western 
blots revealed that the amount of the antibody fixed to 
the two major bands seen in the sigEA84 allele encodes a 
protein with the apparent molecular weight of a E but the 
sigEA84 product accumulated to a level only 10 to 2 0% of 
that obtained from the wild-type sigE allele in E. coli. 

EXAMPLE 2 

Effect of the siaEA84 allele on B. subtilis 



JH642 was transformed with pJMA84 . Cells that had 
integrated this nonreplicating plasmid into their 
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chromosomes were selected by chloramphenicol resistance. 
Recombination between pJMA84 and its homologous sequences 
on the B. subtllis chromosome could occur either upstream 
or downstream of the A84 deletion. Recombination 
5 upstream of the deletion would place the mutant allele 

under the control of the spoIIG promoter, allowing it to 
be expressed. A recombination event downstream of the 
mutation would leave a wild-type allele as the expressed 
copy. The A84 deletion lay approximately 200 bp from the 
10 upstream end of the 1.02-kpb fragment. It was 
anticipated that approximately 20% of the 
chloramphenicol-resistant transformants should have the 
mutant allele as the expressed copy under conditions of 
random recombination. 

15 

Of the 497 transformants receiving pJMA84, 127 (25%) 
were Spo" (i.e., colonies remained white and began to lyse 
after 24 h at 37°C on DS medium) . In a similar 
experiment in which the transforming DNA was pJM102 with 

20 a wild-type sigrE allele (PJME-1) , no Spo - transformants 

were observed (O of 463). Southern blot analysis of four 
Spo* and four Spo" clones which had been transformed with 
pJMA84 revealed the wild-type sigE allele as part of the 
spoIIG operon in the Spo + clones and the sigEA84 allele 

25 as the operon component in the Spo' clones. 

The sigE gene of JH642 was replaced with sigEA84 by 
transforming JH642 (Phe) with SMY chromosomal DNA (Phe + ) 
plus plasmid pJMA84. Phe + transformants were screened 

30 for Spo" clones. The chromosomal DNAs were then examined 
by Southern blot analysis. These DNAs contained a 
PstJ-generated fragment of 1.03 kbp rather than the 
parental fragment of 1.1 kpb, which hybridized to a 
sigrE-specif ic DNA probe. One of these clones (SEA84-1) 

35 was characterized further. 



WO 93/03156 



PCT/US92/05810 



-15- 

The activity of the a E in clone a E A84-l was analyzed 
by using a reporter gene (lacZ) fused to a a E -dependent 
promoter (spoIID) (Rong et al . , 1986). SEA84-1 carrying 
a spoIID: :lacZ fusion made virtually no 0-galactosidase 
during growth or sporulation in DS medium. Western blot 
analysis failed to detect any o E at times when a E is 
normally abundant. No sigma-type protein was detected in 
B . subtilis when an attempt was made to visualize its 
product by immunoprecipitation of a radiolabeled product 
or induction in B. subtilis from an spac promoter. Both 
of these methods generate readily detectable P 31 and a E 
from the wild-type sigE allele. 

EXAMPLE 3 

Translational Fusions to siaE 

A translation fusion between sigE alleles and the E . 
coli lacZ gene was constructed. 

As described in the Materials and Methods section, a 
340-bp fragment containing upstream DNA and the coding 
sequence for the first 55 amino acids of P 31 was cloned 
into a plasmid vector (pSGMU3lA2) so as to create an 
in-frame fusion between P 31 and the E. coli lacZ gene. A 
similar fusion was created by using the analogous DNA 
fragment (256 bp) from sigrEA84. The coding sequence for 
the first 2 6 amino acids from the sigrEA84 gene was fused 
to lacZ. The vector used was incapable of replication in 
B. subtilis but carried an antibiotic resistance gene 
(cat) expressible in B . subtilis. Thus, transformation 
of B. subtilis with the fusion plasmids followed by 
selection for chloramphenicol resistance yielded clones 
in which the plasmid was integrated into its chromosomal 
homolog. Integration of the plasmid resulted in a Spo' 
phenotype owing to the positioning of the sigE: zlacZ 
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f us ion gene within the spoIIG operon in place of the 
wild- type allele. 

When the wild-type sigEzzlacZ fusion plasmid was 
introduced into a sigE + B. subtilis strain, all of the 
clones (38 of 38) displayed a Lac + phenotype on plates 
(Table 1). In a similar experiment, only 25% (8 of 32) 
of the transformants receiving the sigrEA84: :IacZ DNA were 
Lac + . As was the case in the previous transformation 
experiment, the integration event of : :2acZ into sigE 
could take place either upstream or downstream of the A84 
mutation. Recombination downstream of the mutation would 
result in creation of a wild- type sigEztlacZ gene. It 
was therefore possible that the Lac + clones represented 
sigE + ::IacZ fusions and that sigEABAxilacZ clones were 
Lac". A Southern blot analysis of the chromosomes of a 
representative sample of Lac + and Lac" transformants that 
had received sigEABA: zlacZ DNA was performed. Six of six 
Lac + clones had a DNA fragment of the size anticipated 
for wild-type sigE upstream of the lacZ gene, and six of 
six of the Lac" clones had a fragment of the size expected 
for sigrEA84 upstream of lacZ. 

To verify that a Lac' phenotype is a consequence of 
the sigrEA84 sequence being upstream of the lacZ gene the 
transformation into SE84-1, a strain which already 
carries the sigEA84 allele was repeated. In these 
experiments, where the creation of a wild-type 
sigEA84: :2acZ fusion was impossible, all of the 
chloramphenicol-resistant transformants that received the 
sigEA84: :IacZ DNA were Lac" (25 or 25), while 
approximately 15% (4 of 27) of the transformants which 
received the sigEABA: ilacZ DNA were Lac". Analysis of the 
level of B-galactosidase in extracts prepared from 
sigEA84 : zlacZ transformants that were phenotypically Lac" 
on plates failed to reveal measurable levels of 
B-galactosidase. Thus, synthesis of the fusion protein 
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mirrors the inability of sigEA84 to form a E itself. In 
both cases, no product was detectable by the assay 
systems employed. 

5 TABLE 1 



Transformation of B. subtilis by sigEizlacZ DNAs* 













No. 


of colonies 


10 


Recipient 


Donor DNA 














fusion 


Total 


Spo' 


Spo" 


Lac* Lac 








Examined 










JH642 


sigE: :lacZ 


38 


38 


0 


38 0 


15 




sigEA84 : xlacZ 


32 


32 


0 


8 24 




SE84 


sigE: tlacZ 


27 


27 


0 


23 4 






sigEA84 : :lacZ 


25 


25 


0 


0 25 



■ Competent recipient cells were transformed with plasmid 
20 DNA. Transformants were selected on LB plates 

supplemented with chloramphenicol (5 nqml) . Cm r clones 
were replica plated onto DS plates (22) with and 
without an agar overlay containing X-gal 
( 5-bromo-4 -chloro-3 -indoly l-B-/d- 
25 thiogalactopyranoside) (0.5 mg/ml) (18) . Spo + colonies 

turned brown after 24 hr at 37°C on DS plates, while 
Spo* colonies remained white. Lac* colonies turned blue 
on DS with X-gal by 18 hr at 37 °C. 



The cloned sigE gene was mutagenized by using an 
oligonucleotide (30-mer) that would generate an altered 
sigEA84 with a Lys codon (AAA) between the initiation 

3 5 codon (AUG) and the codon specifying the cr E amino terminus 

(GGC) . Mutant clones were isolated, and their structures 
were verified by DNA sequencing. 

When introduced into B . subtilis, sigrEA81 behaved 

4 0 identically to sigEA84. It conferred a Spo" phenotype on 

cells which carried it (i.e., when sigrEA84 was cloned 
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into pJM102 and transformed into JH642, approximately 20% 
of the chloramphenicol-resistant transf ormants were Spo) , 
and cells into which the sigEABl allele had been 
congressed synthesized no detectable a E A81 protein. Thus, 
the addition of AAA at position two of sigEA81 yielded no 
measurable increase in a E levels. 

During the course of the mutagenesis protocol a sigE 
mutant was generated (sigEA48) that encoded a protein 
with the four amino acids of the amino terminus of 
sigEASl joined by an Arg residue to residue 22 of P 31 . 
The sigEA48 mutation was initially identified by the 
intermediate size of the PstI DNA fragment which carried 
it. Its structure was determined by DNA sequencing. 

When sigEA48 was moved into B. subtilis, the cells 
became Spo". However, unlike sigEA84 and sigEA81, sigEA48 
synthesized an active o^-like protein. The sigEA4B 
product was seen in Western blots, and its activity was 
evident by the activation of the spoIID promoter. The 
activation of the spoIID promoter by sigEA48 does not 
apparently require the processing of a E A48 into the 
wild-type a E protein. Transcription of spoIID occurred in 
B. subtilis strains carrying sigEA48 even if the strain 
had a second mutation in a gene (spoIIGA) that is 
essential for processing. In addition, no change was 
detected in the mobility of a E A48 at times in sporulation 
(t 4 ) when the processing activity should be present. 
Thus, a E A48 is both insensitive to processing and active 
without processing. The level of product present in the 
sigEA48 strain was comparable to that found in the 
wild-type sigE strain. 

Extracts were produced from B . subtilis strains lacking 
a chromosomal copy of sigE but containing plasmid-borne 
copies of either the wild-type (sigE + ) or one of the 
mutant alleles of sigE (pSR51 or pSR54) . Western blot 
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analysis for proteins of the size of either P 31 or a E -sized 
proteins in this system indicated no detectable proteins 
in the extracts prepared from the strains with the mutant 
sigE alleles. This result was the same obtained with the 
5 sigEA84 and sigEABl alleles. 

fl-cralactosidase Fusion Proteins 

10 

B . subtilis strains expressing 0-galactosidase fusion 
proteins were constructed as described (Jonas et al . , 
1990) . The E. coli lacZ gene was translationally coupled 
to sigE by cloning a 340-bp Pstl-SauIIIAl fragment of the 

15 1.1 kbp cloned sigE gene into Pstl-BamHI-cut pSGMU3lA2. 
This coupled the 5' end of lacZ in frame to sigE at a 
site 165 bp downstream of the sequence encoding the amino 
terminus of P 31 . In a similar fashion, the bp segment 
encoding the polypeptide having the amino acid sequence 

20 of Figure 1 was cloned into this vector. This construct 
was formed at the same site within sigE but had only the 
base pairs shown in Figure 2 of the sigE protein-coding 
region upstream of lacZ. JH642 was transformed with 
PSGMU31A2 containing the coding sequence for either the 

25 sigE + or si gEA8 4 -encoded amino terminus. Clones in which 

the plasmid had integrated into the B. suJbtilis 
chromosome were selected by chloramphenicol resistance 
(2.5 /ig/ml) and screened by Southern blot analysis for 
integration into the sigE locus with the proper sigE 

30 allele positioned upstream of lacZ. 

The half-life of the fusion proteins expressed from 
transformed Escherichia coli or B . subtilis host cells 
were measured. Table 2 shows the results. 

35 
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Protein Microorganism Half Life 



Sigma E 


£. 


coli 


ND 




0-galactosidase 


E. 


coli 


30 


min 


0-galactosidase 


B. 


suJbtilis 


ND 




Sigma E:SS' 


£. 


coli 


>2 


hr 


/S-galactosidase: SS 1 


£. 


coli 






/3-galactosidase : SS l 


B. 


subtilis 


>2 


hr 



1 SS is the 29AA sequence shown in Figure 1. 

15 EXAMPLE 5 

Structural Perturbations in Prot ein Stabilization 
Sequence 

20 The effect of perturbation of cc-helical structure of 

the protein stabilization sequence shown in Figure 1 was 
examined by altering the primary amino acid sequence. 

The DNA sequence shown in Figure 2 was altered by 
25 insertion of an arginine codon at position 10, resulting 
in a new sequence encoding a 30-amino acid polypeptide. 
In order to effect the insertion, a restriction site was 
created at codon position 11 by substituting codon CTA 
for CTC. The base pair segment coding for the 30-amino 
30 acid polypeptide and a gene segment encoding 0- 

galactosidase were cloned. Expression of j8-galactosidase 
from a B . subtilis bacterial host cell was examined by 
Western blot. Amounts of expressed fusion polypeptide 
were less than measured in systems utilizing the 2 9 -amino 
35 acid stabilization sequence. 
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The hydrophobic and lv'drophilic faces of the a-helix 
formed by the 30-amino aciu polypeptide were twisted 
relative to each other by about 100°. 

Additional insertions into the DNA sequence encoding 
the 30 amino acid polypeptide were made. Codons CTC and 
GCC encoding leucine and alaline were inserted between 
codon 10 and codon 11 of the base sequence, resulting in 
a 180° twist of the polypeptide a-helix structure as 
compared with the a-helix of the 29 amino acid 
polypeptide structure. The new 32 amino acid polypeptide 
was expressed as a fusion polypeptide with 0- 
galactosidase from a transformed B . subtilxs host. 
Expression was less efficient than with fusions formed 
from either the 29 or the 3 0 amino acid protein 
stabilizing sequence, as indicated by Western blot 
analysis. 
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CLAIMS 

1. A protein stabilization sequence comprising an amino 
acid sequence attached to a protein to be stabilized, 
said sequence including a number of amino acid residues 
sufficient to form a structure having an outwardly 
directed hydrophobic face and a positively charged polar 
face, 

2. The protein stabilization sequence of claim 1 
wherein the structure is an ot-helix. 

3. The protein stabilization sequence of claim 1 having 
up to 100 amino acid residues. 

4. The protein stabilization sequence of claim 1 
wherein the outwardly directed hydrophobic face is in 
alignment with the positively charged face. 

5. The protein stabilization sequence of claim 1 
wherein the amino acid sequence is from about 10 to 
about 50 amino acid residues. 

6. The protein stabilization sequence of claim 1 
wherein the amino acid sequence is 29 amino acid 
residues. 

7. The protein stabilization sequence of claim 6 
wherein the sequence comprises the sequence of Figure 1 
or a biologically functional equivalent thereof. 
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8. A method of stabilizing proteins comprising 
attaching the protein stabilization sequence of claim 1 
to a proteolytically sensitive protein. 

9. The method of claim 8 wherein the attaching is to an 
amino terminal end of a proteolytically sensitive 
protein. 

10. The method of claim 8 wherein the attaching is to a 
carboxy terminal end of a proteolytically sensitive 
protein. 

11. The method of claim 8 wherein the attaching is to a 
proteolytically sensitive protein at both terminii. 

12. The method of claim 8 wherein the proteolytically 
sensitive protein is RNA polymerase sigma factor. 

13. The method of claim 12 wherein the RNA polymerase 
sigma factor is from £. subtilis or E. coli. 

14. A method of enhancing stability of recombinant 
proteins comprising 

obtaining a gene segment encoding the protein 
stabilization sequence of claim 1; 

preparing a recombinant vector having a gene 

encoding the protein stabilization sequence and 
a gene encoding a protein to be stabilized; 
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transforming a host cell with the recombinant 
vector ; 

selecting transformed host cell colonies; and 

facilitating expression of a heterologous protein 
fused N- terminally with the protein 
stabilization sequence. 

15. The method of claim 14 wherein the protein to be 
stabilized is j8-galactosidase. 

16. The method of claim 14 wherein the host cell is a 
bacterium. 



17. The method of claim 14 wherein the host cell is B . 
subtilis or E, coli. 



18. A system for production of a stable, heterologous 
protein in prokaryotic hosts comprising DNA having a 
coding sequence for the protein stabilization sequence of 
claim 1 operably linked to and out of reading frame with 
a terminal leader DNA sequence encoding a protein 
expressible as an active fusion protein. 

19. A DNA segment which codes for the protein 
stabilization sequence of claim 1 or a functionally 
stabilizing equivalent thereof. 
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20. The DNA segment of claim 19 having the base pair 
sequence of Figure 2 or a functionally stabilizing 
structural equivalent thereof. 

21. A recombinant DNA vector comprising a first base 
sequence coding for the protein stabilization sequence of 
claim 1 or a second base sequence complementary to the 
first base sequence. 
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